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COMPOUNDS RELATED TO PAP-1 

The present invention relates to polynucleotides, herein referred to as CASB4^ 
polynucleotides, polypeptides encoded thereby (referred to herein as CASB4:' 
polypeptides J. recombinant matenals and methods for their production. In another aspect, 
the m\ention relates to methods for using such polypeptides and polynucleotides, including: 
the treatment of cancer and autoimmune diseases and other related conditions. In a funher 
aspect, the mvention relates to methods for identifying agonists and antagonists inhibitors 
using the materials provided by the mvemion, and treating conditions associated with 
CASB47 polypeptide imbalance with the identified compounds. In a stiil further aspect, 
the mvemion relates to diagnostic assays for detecting diseases associated with inappropnate 
CASB47 polypeptide activity or levels. 

Polypeptides and polynucleotides of the present invention are believed to be important 
immunogens for specific prophylactic or therapeutic immunization agamst tumours, because 
they are specifically expressed or highly over-expressed m mmours compared to normal 
cells and can thus be targeted by antigen-specific immune mechamsms leading to the 
destruction of the rumour cell. They can also be used to diagnose the occurrence of rumour 
cells. Furthermore, their inappropriate expression in certain circumstances can cause an 
induction of autoimmune, inappropriate immune responses, which could be corrected 
through appropriate vaccination using the same polypeptides or polynucleorides. In this 
respect the most important biological activities to our purpose are the antigenic and 
immunogenic activities of the polypeptide of the present mvention. A polypeptide of the 
present invention may also exhibit at least one other biological activity of a CASB47 
polypeptide, which could qualify it as a target for therapeutic or prophylactic intervention 
different from that linked to the immune response. 

Functional genomics relies heavily on high-throughput DNA sequencing technologies and 
the various tools of bioinformatics to identify gene sequences of potential interest from the 
many molecular biology databases now available. cDNA libranes ennched for genes of 
relevance to a panicular tissue or physiological situation can be constructed using recently 
developed subtractive cloning strategies. Furthermore, cDNAs found in libranes of certain 
tissues and not others can be identified using appropnate electromc screemng methods. 
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High throughpLit genome- or gene-based b.oiogy aJlows new approaches :o me ident:ricat:or. 
jr.ci domng onarget genes tor usemi mmune responses tor the p^e^ entIO^ and % accme 
tnerap\ of diseases such as cancer and autoimmunity . 

in a nrst aspect, the present invention relates to C.ASB47 polypeptides. Such pertiaes 
include isolated polypeptides comprismg an ammo acd sequence which nas at leasi -O" o 
identit) . preferably at least 80% identity, more preferably at least 90% ident::> , -.et mors 
prererably at least 95% identity, most preferably at least 97.99°o identity , to t.hai of SEQ 
ID NO:; over the entire length of SEQ ID NO::, Such polypeptides include those 
comprising the amino acid of SEQ ID NO:;. 

Funher peptides of the present invention include isolated polypeptides in which the 
amino acid sequence has at least 70% identity, preferably at least 80% identity, more 
preferably at least 90% identity, yet more preferably at least 95% identity, most 
preferably at least 97-99% identity, to the amino acid sequence of SEQ ID N0:2 over the 
entire length of SEQ ID N0:2. Such polypeptides include the polypeptide of SEQ ID 
NO:;. 

Funher peptides of the present invention include isolated polypeptides encoded by a 
polynucleotide comprising the sequence contained in SEQ ID NO: 1 

The invention also provides an immunogenic fragment of a CASB47 polypeptide, that is a 
contiguous pomon of the CASB47 polypept.de which has the same or similar immunogenic 
propenies to the polypeptide compnsing the amino acid seqeunce of SEQ ID NO::. That is 
to say. the fragment (if necessary when coupled to a earner) is capable of raising an immune 
response which recognises the CASB47 polypeptide. Such an tmmunogemc fragment may 
include, for e.xample. the CASB47 polypeptide lacking an N-terminal leader sequence, a ' 
transmembrane domain or a C-tetminal anchor domain. In a preferred aspect the 
immunogenic fragment of CASB47 accordmg to the invention comprises substantially all of 
the e.wacellular domam of a polypeptide which has at least 70% identity, preferably at 
least 80°/o identity, more preferablv at least 90% identity, yet more preferablv at least 
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95°/o identic>'. most preferably at least 97-99% identity, to that of SEQ [D N0:2 o\er the 
entire length of SEQ ID N0;2 

The polypeptides or immunogenic fragment of the invention may be in the form of the 
""maiure" protem or may be a part of a larger protem such as a precursor or a fusion 
protein. It is often advantageous to include an additional amino acid sequence uhicn 
contains secretory or leader sequences, pro-sequences, sequences which aid in 
purification such as multiple histidine residues, or an additional sequence tor stabiiitv 
dunng recombinant production. Furthermore, addition of exogenous polypeptide or lipid 
tail or polynucleotide sequences to increase the immunogenic potential of the final 
molecule is also considered. 

In one aspect, the invention relates to genetically engineered soluble fusion proteins 
compnsing a polypeptide of the present invention, or a fragment thereof, and various 
ponions of the constant regions of heavy or light chains of immunoglobulins of vanous 
subclasses (IgG, IgM, IgA, IgE). Preferred as an immunoglobulin is the constant part of 
the heavy chain of human IgG, particularly IgGl, where fusion takes place at the hinge 
region. In a panicular embodiment, the Fc part can be removed simply by incorporation 
of a cleavage sequence which can be cleaved with blood clotting factor Xa. Furthermore, 
this invention relates to processes for the preparation of these fusion proteins by genetic 
engineering, and to the use thereof for drug screening, diagnosis and therapy. A further 
aspect of the invention also relates to polynucleotides encoding such fusion proteins. 
Examples of fusion protein technology can be found in International Patent Application 
Nos. W094/29458 and W094/22914. 

The proteins may be chemically conjugated, or expressed as recombinant fusion proteins 
allowing increased levels to be produced in an expression system as compared to non- 
fused protein. The fusion partner may assist in providing T helper epitopes 
( immunological fusion partner), preferably T helper epitopes recognised by humans, or 
assist in expressing the protein (expression enhancer) at higher yields than the native 
recombinant protein. Preferably the fusion partner will be both an immunological fusion 
partner and expression enhancing partner. 
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Fusion panners include protein D t'rom Haemophilus influenza B and the non-srraciura! 
proie:n from influenzae virus. NSl ( hemaggJuunm ). .\noiher immunological fusion 
partner is the protein ioiown as LYTA. Preferably the C termmai ponion of the moiecuie 
15 used. Lyia is derived from Sirep(ococcus pneumoniae which synthesize an N-acetyl-L- 
^ alanine amidase. amidase LYTA. (coded by the KiA gene [Gene. -13 ( i986) pa-e 265- 
an autolysin that specitically degrades certain bonds in the pei^tidogivc.in Dackbonc 
The C-terminai domain of the LYTA protein is responsible for the aftmitv to tne choline 
or to some choime analogues such as DEAE. This property has been exploited for the 
deNelopment of E.coli C-LYTA expressing plasmids useful for expression of fusion 
0 proteins. Puntkation of hybrid proteins comaimng the C-LYTA fragment at its amino 
terminus has been described {Biotechnology: 10, (1992) page 795-98;. It is possible to 
use the repeat ponion of the Lna molecule found in the C termmai end siamng at residue 
178. for example residues 188 - 305. 

5 The present invention also mcludes vanants of the aforementioned polypeptides, that is 
poh peptides that vary from the referents by conservative amino acid substitutions. whereb> 
a residue is substituted by another with like charactenstics. Typical such substimtions are 
among Ala, Val. Leu and He; among Ser and Thr: among the acidic residues Asp and Glu: 
among Asn and Gin: and among the basic residues Lys and .Arg; or aromatic residues Phe 
and Tyr Paniculariy preferred are variants in which several, 5-10. 1-5. 1-3. 1-2 or 1 ammo 
acids are substituted, deleted, or added m any combination. 

Polypeptides of the present invention can be prepared in any suitable manner. Such 
polypeptides include isolated namrally occumng polypeptides, recombmantly produced 
polypeptides, synthetically produced polypeptides, or polypeptides produced by a 
combination of these methods. Means for preparing such polypeptides are well understood 
in the an. 

In a funher aspect, the present invention relates to CASB47 polynucleotides. Such 
polynucleotides include isolated polynucleotides compnsmg a nucleotide sequence encoding 
a polypeptide which has at least 70% identity, preferably at least 80% identity, more 
preferably at least 90% identity, yet more preferably at least 95% identity, to the amino 
acid sequence of SEQ ID N0:2. over the emire length of SEQ ID N0:2. In this regard. 

4 
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poiypep[ides which has e at least 97% identic are highly preferred, whilst those u ;th at leas: 
98-99'^ 0 identir>' are more highly preferred, and those with at least 99*^/0 identit>' are most 
highJy preferred. Such polynucleotides include a polynucleotide comprising the nucleotide 
sequence contained in SEQ ID N0:1 encoding the polypeptide of SEQ ID NO;: 

Further polynucleotiaes of the present invention include isolated polynucleotides comprising 
a nucleoude sequence that has at least 70% idemit>-. preferably at least 80% identity, more 
preferably at least 90% identity, yet more preferably at ieasi 95% identity, to a nucleotide 
sequence encoding a polypeptide of SEQ ID N0:2. over the entire coding region In this 
regard, polynucleoudes which have at least 97% identity are highJy preferred, whilst those 
with at least 98-99% identity are more highly preferred, and those with at least 9Q% ldentlt^ 
are most highly preferred. 



Further polynucleotides of the present invention include isolated polynucleotides 
i : comprising a nucleotide sequence which has at least 70% identity, preferably at least S0% 
identity, more preferably at least 90% identity, yet more preferably at least 95''o identir>-. 
to SEQ ID NO: 1 over the entire length of SEQ ID NO: 1 . In this regard, polynucleotides 
which have at least 97% identity are highly preferred, whilst those with at least 98-99% 
idenii\- are more highly preferred, and those with at least 99% identity are most highU 
20 preferred. Such polynucleotides include a polynucleotide comprising the polynucleotide of 
SEQ ID NO: 1 as well as the polynucleotide of SEQ ID NO: 1 . Said polynucleotide can be 
insened in a suitable plasmid or recombinant microrganism vector and used for 
immunization ( see for example Wolff et. al.. Science 247:1465-1468 (1990); Corr ei. al., J. 
Exp. Med. 184:1555-1560 (1996): Doe ei. al., Proc. Natl. Acad. Sci. 93:8578-8583 ( 1996)). 
25 The invention also provides polynucleotides which are complementary to all the above 
described polynucleotides. 



The invention also provides a fragment of a CASB47 polynucleotide which when 
administered to a subject has the same immunogenic properties as the polynucleotide of 
30 SEQ ID NO: I. 



The invention also provides a polynucleotide encoding an immunological fragment of a 
C.ASB47 polypeptide as hereinbefore defined. 
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The nucleotide sequence of SEQ ID NO: 1 shows homology with mouse PAP- \ (GenBank 
accession D78255 ). The nucleotide sequence of SEQ ID N0:1 is a cDNA sequence and 
compnses a polypeptide encoding sequence (nucleotide 25 to 690) encodmg a pois peptide 
of 22 1 ammo acids, the polypeptide of SEQ ID N0:2. The nucleotide sequence encodmg 
the polypeptide of SEQ ID N0;2 may be identical to the polypeptide encoding sequence 
contamed in SEQ ID NO; 1 or it may be a sequence other than the one contained m SEQ 
ID NO: 1, which, as a result of the redundancy (degeneracy) of the genetic code, also 
encodes the polypeptide of SEQ ID N0:2. The polypeptide of the SEQ ID N0:2 is 
strucrurally related to other proteins of a novel family, havmg homology and or structural 
similarity' with mouse PAP-1 (accession Dl 01 1981). 

Preferred polypeptides and polynucleotides of the present mvemion are expected to have. 
mrer alia, similar biological functions/ propenies to their homologous polypeptides and 
polynucleotides. Furthermore, preferred polypeptides, immunological fragments and 
polynucleotides of the present invention have at least one activity of either SEQ ID NO: 1 or 
SEQ ID N0:2- as appropriate. 

The present invention also relates to panial polynucleotide and polypeptide sequences which 
were tlrst identified pnor to the detenmination of the corresponding foil length sequences of 
SEQ ID NO: 1 and SEQ ID N0:2. 



-Accordingly, m a further aspect, the present invention provides for an isolated 
polynucleotide which; 

:5 I a) comprises a nucleotide sequence which has at least 70% identity, preferably at least 
80% identity, more preferably at least 90% identity, yet more preferably at least 95% 
identit> . even more preferably at least 97-99% identity to SEQ ID N0:3 over the entire 
length of SEQ ID N0:3; 

(b) has a nucleotide sequence which has at least 70% identity, preferably at least 80% 
.'0 ;dentit> . more preferably at least 90% identity, yet more preferably at least 95% identit) . 
even more preferably at least 97-99% identity, to SEQ ID NO: 1 over the enure length of 
SEQ ID .\'0:3: 

(c> the polynucleotide of SEQ ID N0:j; or 

6 
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( d) a nucleotide sequence encoding a polypeptide which has at least 70% identit> . 
preferably at least 80% identity, more preferably at least 90% identity, vet more 
preferably at least 95% identity, even more preferably at least 97-99% identitv. to the 
amino acid sequence of SEQ ID N0;4. over the entire length of SEQ ID N0:4. 
as well as the polynucleotide of SEQ ID NO:j. 

The present m\'emion further provides for a polypeptide which; 

(a) comprises an ammo acid sequence which has at least 70% identity preferabiv at least 
80^^/0 identity, more preferably at least 90% identity, yet more preferabiv at least *^5°o 

;u identity, most preferably at least 97-99% identity, to that of SEQ ID NO;2 over the entire 
length of SEQ ID NO:4; 

(b) has an amino acid sequence which is at least 70% identity, preferabiv at least 80% 
identity, more preferably at least 90% identity, yet more preferably at least 95% identity, 
most preferably at least 97-99% identity, to the amino acid sequence of SEQ ID N'0;2 

1 over the entire length of SEQ ID N0:4; 

(c) compnses the amino acid of SEQ ID N0:4; and 

(d) is the polypeptide of SEQ ID N0:4; 

as well as polypeptides encoded by a polynucleotide comprising the sequence contained 
in SEQ IDN0:3. 

:o 

The nucleotide sequence of SEQ ID N0:3 and the peptide sequence encoded thereby are 
derived from EST (Expressed Sequence Tag) sequences. It is recognised by those skilled 
in the an that there will inevitably be some nucleotide sequence reading errors in EST 
sequences (see Adams, M.D. et ai Nature 377 (supp) 3. 1995). Accordingly, the 
25 nucleotide sequence of SEQ ID N0:3 and the peptide sequence encoded therefrom are 
therefore subject to the same inherent limitations in sequence accuracy. Funhermore. the 
peptide sequence encoded by SEQ ID NO:3 compnses a region of identity or close 
homology andor close structural similarity (for example a conservative amino acid 
difference) with the closest homologous or structurally similar protein. 

30 

Polynucleotides of the present invention may be obtamed. using standard cloninii and 
screening techniques, from a cDNA library denved from mRNA in cells of human colon 
cancer, (for e.xample Sambrook et al.. Molecular Cloning: A Laboratory Manual. 2"^ Ed.. 
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Co:d Spnng harbor Laborator>- Press. Cold Spnng harbor. N.V. (1989)). Polvnucleotides of 
the invention can also be obtained from natural sources such as genomic DNA libraries or 
can be synthesized using well known and commercially available techniques. 

VVnen polvnucteondes of the present invention are used for the recombinant production of 
polypeptides of the present invention, the polynucleotide may include the coding sequence 
for the mature pol> peptide. b>' itself; or the coding sequence for the mature polypeptide in 
reaaing frame with other coding sequences, such as those encoding a leader or secreton. 
sequence, a pre-, or pro- or prepro- protein sequence, or other ftision peptide portions, For 
example, a marker sequence which facilitates punfication of the fused polypeptide can be 
encoded. In certain preferred embodunents of this aspect of the invention, the marker 
sequence is a hexa-histidine peptide, as provided in the pQE vector (Qiagen. Inc.) and 
descnbed in Geniz etal.. Proc Natl Acad Sci USA (1989) 86:821-824. or is an HA tag The 
polynucleotide may also contain non-coding 5* and 3' sequences, such as u-anscnbed. non- 
translated sequences, splicing and polyadenylation signals, ribosome binding sites and 
sequences that stabilize mRNA. 



Fmrher embodiments of the present invention include polynucleotides encoding polypeptide 
vanants which compnse the amino acid sequence of SEQ ID N0;2 and in which several, for 
20 instance from 5 to 10. 1 to 5. 1 to 3. 1 to 2 or 1, ammo acid residues are substituted, deleted 
or added, in any combination. 



Polynucleotides which are identical or sufficiently identical to a nucleotide sequence 
contained in SEQ ID N0:1. may be used as hybridization probes for cDNA and genomic 

25 DNA or as pnmers for a nucleic acid amplification (PGR) reaction, to isolate hill-length 
cDNAs and genomic clones encoding polypeptides of the present invention and to isolate 
cDN.A and genomic clones of other genes (including genes encoding paralogs from human 
sources and orthologs and paralogs from species other ±aj\ human) that have a high 
sequence similanry to SEQ ID NO: 1 . Typically these nucleotide sequences are 70% 

30 identical, preferably 80% identical, more preferably 90% identical, most preferably 95° o 
identical to that of the referent. The probes or pnmers will generally compnse at least 1 5 
nucleotides- preferably, at least 30 nucleotides and may have at least 50 nucleotides. 
Panicuiarly preferred probes will have between 30 and 50 nucleotides. PanicularK 

8 
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preferred pnmers will have between 20 and 25 nucleotides. In panicuiar, polypeptides or 
polynucleotides denved from sequences from homologous ammai ongm could be used as 
immunogens to obtain a cross-reactive immune response to the human sene. 

A poiynucleotide encodmg a polypeptide of the present invention, including homologs rrom 
species other than human, ma} be obtamed by a process which compnses the steps of 
screenmg an appropnate Ubrar>- under stnngent hybndization conditions with a labeled 
probe ha\'mg the sequence of SEQ ID NO: 1 or a fragment thereof: and isolating full-lenuth 
cDNA and genomic clones containing said polynucleotide sequence. Such hybridization 
techniques are well known to the skilled artisan. Preferred stnngent hybndization conditions 
include overnight incubation at 42^C in a solution comprising: 50% formamide. 5xSSC 
(ISOmM NaCl. 15mM trisodium citrate). 50 miM sodium phosphate {pH7.6). 5x Denhardfs 
solution. 10 % dextran sulfate, and 20 microgram/ml denatured, sheared salmon sperm 
DNA; followed by washing the filters in 0.1 x SSC at about 650C. Thus the present 
invention also includes polynucleotides obtainable by screening an appropriate librar\' 
under stingent hybridization conditions with a labeled probe having the sequence of SEQ ID 
NO: 1 or a fragment thereof. 

The skilled artisan will appreciate that, in many cases, an isolated cDNA sequence will be 
incomplete, in that the region coding for the polypeptide is shon at the 5' end of the 

cDNA. 

There are several methods available and well known to those skilled in the an to obtain 
full-length cDNAs. or extend shon cDNAs, for example those based on the method of 
Rapid Amplification of cDNA ends (R.ACE) (see, for example, Frohman et al.. PNAS 
USA 85, 8998-9002. 1988). Recent modifications of the technique, exemplified by the 
Marathon^^ technology (Clontech Laboratories Inc.) for example, have significanth 
simplified the search for longer cDNAs. In the Marathon^^ technology. cDNAs have 
been prepared from mRNA extracted from a chosen tissue and an 'adaptor' sequence 
iigated onto each end. Nucleic acid amplification (PGR) is then earned out to amplify- the 
missmg' 5' end of the cDNA using a combination of gene specific and adaptor specific 
oligonucleotide pnmers. The PGR reaction is then repealed using nested' pnmers. that is. 
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primers designed to anneal vvuhin the amplified product uyp,cali> an adaptor srecric 
primer that anneals ftinher 3' in the adaptor sequence and a gene speofic primer that 
anneals t^rther 5' in the knouT, gene sequence,. The products ot this reaction can then be 
analvsed b> DNA sequencing and a tlill-length cDNA consiracted either by lo.r.ing the 
prod'act directly to the existing cDNA to give a complete sequence, or carr. in; oiit'a 
separate tull-length PCR using the new sequence information tor the design ot :,ie < 
primer. 

Recombinant pol> peptides of the present invention may be prepared b> processes u ell 
knovvT. m the art tVom geneucalK- engineered host cells compnsing expression systems. 
Accoramgh-. in a tiinher aspect, the present invention relates to an expression system which 
compnses a polynucleotide of the present invention, to host cells which are geneticalh 
engineered with such expression sytems and to the production of polypeptides of the 
invention by recombinant techniques. Cell-tree translation systems can also be employed to 
produce such proteins using RNAs derived from the DNA constructs of the present 



invention. 



For recombinant production, host cells can be genetically engmeered to incorporate 
expression systems or ponions thereof for polynucleotides of the present inNention. 
Introduction of polynucleotides into host cells can be effected by methods descnbed in many 
standard laboratory manuals, such as Davis et al.. Basic .Methods in .Vtolecular Biology 
( 1 986) and Sambrook e, al.. Molecular Cloning: A Laboratory Manual. 2nd Ed.. Cold 
Sprmg Harbor Laborator>' Press. Cold Spnng Harbor. N.Y. (1989). Preferred such methods 
include, for instance, calcium phosphate transfeciion. DEAE-de.xtran mediated transfection. 
transvection, microinjection, cationic lipid-mediated transfection. electroporation. 
transduction, scrape loading, ballistic introduction or infecuon. 

Preferabh- the proteins of the invention are coexpressed with thioredoxin in trans fTIT). 
Coe.xpression of thioredoxm m trans versus in cis is preferred to keep antigen free of 
thioredoxin without the need for protease. Thioredoxm coexpression eases the 
solubilisaiion of the proteins of the invention. Thioredoxm coexpression has also a 
significant impact on protein purification yield, on purified-protein solubility and qualit> 
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Representative examples of appropnate hosts include bactenai cells, such as Strcpiococci. 
Sfaphvlococc:, E. coii, Sfrepiomyccs and Bacillus subtilis cells: Ifungal cells, such as veasi 
cells and Aspergillus cells; insect cells such as Drosophila S2 and Spodooiera Sf) cells: 
ommai cells such as CHO. COS. HeLa. CI27. 3T3. BHK. HEK 293 and Bowes melanoma 
cells: and plant cells. 

A great variety oi' expression systems can be used, for instance, chromosomal, episomai 
and v:rus-derivea systems, e.g.. vectors derived from bacterial plasmids. from 
bacteriophage, from transposons. from yeast episomes. from msenion elements, from 
yeast chromosomal elements, from viruses such as baculoviruses, papova viruses, such as 
SV40. \accinia \'iruses, adenoviruses, fowl pox viruses, pseudorabies viruses and 
retroviruses, and vectors derived from combinations thereof, such as those derived from 
plasmid and bactenophage genetic elements, such as cosmids and phagemids. The 
expression systems may contain control regions that regulate as well as engender 
expression. Generally, any system or vector which is able to maintain, propagate or 
express a poiynucieotide to produce a polypeptide in a host may be used. The appropnate 
nucleotide sequence may be inserted into an expression system by any of a variety of 
well-known and routine techniques, such as, for example, those set forth in Sambrook et 
ai. Molecular Cloning. A Laboratory Manual (supra). Appropriate secretion signals may 
be incorporated into the desired polypeptide to allow secretion of the translated protein 
into the lumen of the endoplasmic reticulum, the periplasmic space or the extracellular 
environment. These signals may be endogenous to the polypeptide or thev mav be 
heterologous signals. 

The expression system may also be a recombinant live microorganism, such as a virus or 
bacterium. The gene of mterest can be inserted into the genome of a live recombinant 
virus or bacterium. Inoculation and in vivo infection with this live vector will lead to in 
VIVO expression of the antigen and induction of immune responses. Viruses and bacteria 
used for this purpose are for instance: poxviruses (e g: vaccinia, fowlpox. canary pox), 
alphaviruses (Sindbis virus. Semliki Forest Virus, Venezuelian Equine Encephalitis 
Virus), adenoviruses, adeno-associated virus, picomaviruses (poliovirus. rhinovirus). 
herpesviruses (varicella zoster virus, etc). Listeria, Salmonella . Shigella, BCG These 



wo 9^/49030 



PCT EP99 0189: 



V iruses and bacteria can be virulent, or anenuaied in vanous ways in order to oDtain iive 
vaccines. Such li\'e vaccines aiso form pan of the invenrion. 

Poiypeptides of tne present invention can be recovered and punned from recombinant cell 
r culrures v^ell-knowTi methods including ammonium sulfate or ethanol precipitation, acid 
extraction, aruon or cation exchange chromatography, phosphoceliulose chromatocraph> . 
hydrophobic interaction chromaiograph\ , afrmiiy chromatography, hvdroxyiapatitc 
chromatography and iecun chromatography .Most preferably, ion metal aftlnjt> 
chromatograpny (IMAC) is employed for punficaiion. Weil known techniques for 
■ refolding proteins may be employed to regenerate active conformation when the pol vpeptide 
is denatured dunng mtraceliuiar synthesis, isolation and or purification. 

Another important aspect of the invention relates to a method for inducing . re-inforcins 
or modulating an immunological response in a mammal which comprises inoculating the 

15 mammal with a fragment or the entire polypeptide or polynucleotide of the invention, 
adequate to produce antibody and/or T cell immune response for prophylaxis or for 
therapeutic treatment of cancer and autoimmune disease and related conditions. Yet 
another aspect of the invention relates to a method of inducmg, re-inforcing or 
modulating immunological response in a mammai which comprises, delivering a 

20 polypeptide of the present invention via a vector or cell directing expression of the 
polynucleotide and coding for die polypeptide in vivo in order to induce such an 
immunological response to produce immune responses for prophylaxis or treatment of 
said mammal from diseases. 



A further aspect of the invention relates to an immunological/vaccine formulation 
I composition) which, when introduced into a mammalian host, induces, re-inforces or 
modulates an immunological response in that mammal to a polypeptide of the present 
mvention wherein the composition comprises a polypeptide or polynucleotide of the 
mvention or an immunological fragment thereof as herein before deflned.The vaccine 
formulation may tlirther compose a suitable carrier. Since a polypeptide may be broken 
down m the stomach, it is preferably administered parenteraily (for instance, 
subcutaneous, intramuscular, intravenous, or intradermal injection). Formulations 
suitable for parenteral administration include aqueous and non-aqueous sterile injection 
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soiutions which may contain anti-oxidants. buffers, bactenostats and solutes v\hich renaer 
the formulation isotonic with the biood of the recipient; and aqueous and non-aqueous 
sterile suspensions which may include suspending agents or thickening aeen:s The 
formulations ma> be presented in unii-dose or multi-dose containers, for example, sealed 
ampoules and vials and may be stored in a freeze-dried condition requiring on[\ rhe 
addition of the sterile liquid earner immediately prior to use. 

A rurther aspect of the invention relates to the m vitro mduction of immune responses to a 
tragment or the entire polypeptide or polynucleotide of the present invention or a 
molecule comprising the polypeptide or polynucleotide of the present invention, usine 
cells from the immune system of a manunai. and reinfusing these activated immune cells 
of the mammal for the treatment of disease. Activation of the cells from the immune 
system is achieved by in vitro incubation with the entire polypeptide or poivnucleotide of 
the present invention or a molecule comprising the polypeptide or polynucleotide of the 
present invention in the presence or absence of various immunomoduiator molecules. 
A funher aspect of the invention relates to the immunization of a mammal by 
administration of antigen presentmg cells modified by in vitro loading with part or the 
entire polypeptide of the present invention or a moiecule comprising the polypeptide of 
the present invention and administered in vivo in an immunogenic way. Altematively. 
antigen presentmg cells can be transfected in vitro with a vector containing a fragment or 
the entire polynucleotide of the present invention or a molecule comprising the 
polynucleotide of the present invention, such as to express the corresponding polypeptide, 
and administered in vivo in an immunogenic way. 

The vaccme formulation of the invention may also include adjuvant systems for 
enhancing the tmmunogenicity of the formulation. Preferably the adjuvant system raises 
preferentially a THl type of response. 

.An immune response may be broadly distinguished into two extreme catacories. being a 
humoral or ceil mediated immune responses (traditionally characterised by antibody and 
cellular effector mechanisms of protection respectively). These categories of response 
have been termed THl -type responses (cell-mediated response), and TH2-tvpe immune 
responses (humoral response). 

13 
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Extreme THl-r>-pe iminune responses may be characterised by the generation ot'antiiien 
specific, haplotype restricted cnotoxic T lymphocytes, and natural killer ceil responses, 
in mice THl-type responses are orten characterised by the generation of antibodies of the 
igG:a subtype, whilst in the human these correspond to IgGi type antibodies. TKI-type 
immune responses are charactensed by the generation of a broad ranee of 
immunogiobuim isotypes inciuding in mice IgGl. IgA. and Ig.M. 

It can be considered that the dnvmg force behind the development of these two types of 
:mmune responses are cytokmes. High levels of THl-type cytokines tend to favour the 
induction of ceil mediated immune responses to the given antigen, whilst high levels of 
TH2-type cytokines tend to favour the induction of humoral immune responses to the 
anti2en. 



1 --^ The distinction of TH 1 and TH2-type immune responses is not absolute. In reality an 
individual will support an immune response which is described as being predominantlv 
THl or predommantiy TH2. However, it is often convenient to consider the families of 
cytokines in terms of that descnbed in murine CD4 +ve T ceil clones by Mosmann and 
Coffman (Mosmann, T.R. and Coffman, R.L (1989) THl and TH: cells: different 

20 patterns ofiymphokine secretion lead to different functional properties. Annual Review of 
Immunology. pl4j-l73). Traditionally. THl-type responses are associated with the 
production of the [NF-y and 11-2 cytokmes by T-lymphocytes. Other cytokines often 
directly associated with the induction of THl-type immune responses are not produced by 
T-ceiis. such as IL-12. In contrast, TH2- type responses are associated with the secretion 

25 of IL-4. [L-5, IL-6 and IL-13. 

[t is known that certain vaccine adjuvants are particularly suited to the stimulation of 
either THl or TH2 - type cytokine responses. Traditionally the best indicators of the 
TH1:TH2 balance of the immune response after a vaccination or infection includes direct 
30 measurement of the production of THl or TH2 cytokines by T lymphocytes m vitro after 
restimuiation with antigen, and or the measurement of the IgG I :IgG2a ratio of antigen 
specific antibody responses, 
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Thus, a THl -type adjuvant is one which preferemiaJly stimulates isolated T-celi 
populations to produce high levels of THl -type cytokines when re-stimuiated wuh 
antigen m vitro, and promotes development of both CDS- cytotoxic T lymphocytes and 
antigen specific immunoglobulin responses associated wuh THl-tvpe isotvpe. 

AdiLivants which are capable of preferential stimulation of the THl cell response are 
descrined in International Patent Application No. U'O 94. 00153 and WO 9^ P:0^) 

3 De-O-acylated monophosphor>'l lipid A f3D-MPL) is one such adjuvant. This is known 
1 1) from GB 22202 1 1 (Ribi i. Chemically it is a mixture of 3 De-O-acylated 

monophosphorv'l lipid A with 4, 5 or 6 acylated chains and is manufactured bv Ribi 
Immunochem. Montana. A preferred form of 3 De-O-acylated monophosphorvi lipid A 
is disclosed in European Patent 0 689 454 Bl (SmithKlme Beecham Biologicals SA). 

1 5 Preferably, the panicles of 3D-MPL are small enough to be sterile filtered through a 
0.22micron membrane (European Patent number 0 689 454). 
3D-MPL will be present in the range of lO^g - lOOjig preferably 25-50ug per dose 
wherein the antigen will typically be present in a range 2-50fig per dose. 

20 .Another preferred adjuvant composes QS2 1 , an Hpic punfied non-toxic fraction derived 
from the bark of Quiliaja Saponaria Molma. Optionally this may be admixed with 3 De- 
O-acylated monophosphorvi lipid A (3D-MPL). optionally together with an carrier 

The method of production of QS21 is disclosed in US patent No. 5.057,540. 

25 

Non-reactogenic adjuvant formulations coniaimng QS21 have been described previously 
fWO 96/33739). Such formulations compnsing QS21 and cholesterol have been shown 
to be successful THl stimulating adjuvants when formulated together with an antigen. 

30 Funher adjuvants which are preferential stimulators of THl cell response include 
immunomodulatory oligonucleotides, for example unmethylated CpG sequences as 

disclosed in WO 96/02555. 

15 
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Combinations of different THl stimuiating adjuvants, such as those mentioned 
hereinabove, are also comempiated as providing an adjuvant which is a preferential 
stimulator of THl cell response. For example. QS2I can be formulated together with 3D- 
MPL The ratio of QS:i ; 3 D-MPL will r>-pically be m the order of i : 10 to 10: 1. 
preferaD[> 1 :5 to 5 : 1 and often substantially 1:1. The preferred ranee for opttmat 
svner^> is 2.: 1 to 1 : 1 3D-MPL: QS:i. 

Prererably a carrier is also present in the vaccine composition according to the invention 
The earner may be an oil in water emulsion, or an aiummium salt, such as aluminium 
phosphate or aluminium hydroxide. 

A preferred oii-in-water emulsion composes a metaboiisible oil. such as squalene. alpha 
tocopherol and Tween 80. In a particularly preferred aspect the antigens in the vaccme 
composition according to the invention are combined with QS21 and 3D-.MPL in such an 
emulsion. Additionally the oil in water emulsion may contain span 85 andyor lecithin 
and/or tncaprylin. 

Typically for human administration QS21 and 3D-MPL will be present m a vaccine in the 
range of l^ig - 200^g. such as 10-100|ig. preferably 10^g - 50ug per dose. Typically the 
oil in water will comprise from 2 to 10% squalene. from 2 to 10% alpha tocopherol and 
from 0.3 to 3% tween 80. Preferably the ratio of squalene: alpha tocopherol is equal to 
or less than 1 as this provides a more stable emulsion. Span 85 may also be present at a 
level of 1 %. In some cases it may be advantageous that the vaccines of the present 
invention will further contain a stabiliser. 

Non-toxic oil in water emulsions preferably contain a non-toxic oil. e.g. squalane or 
squalene. an emulsifier, e.g. Tween 80, in an aqueous carrier. The aqueous carrier may 
be. for example, phosphate buffered saline. 

A panicularly potent adjuvant formulation involving QS2L 3D-MPL and tocopherol in 
an oil in water emulsion is described in WO 95/17210. 
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The present invention also provides a polwalent vaccine composition compnsing a vaccine 
tbrmulation of the invention in combination with other antigens, in panicular antiaens usetlii 
for treating cancers, autoimmune diseases and related conditions. Such a polv\ alent vaccine 
composition ma>' include a TH-l inducing adjuvant as hereinbefore descnbed. 

This m\ ention also relates to the use of polynucleotides of the present invention as 
diagnosuc reagents. Detection of a mutated form of the gene charactensed hv the 
polynucleotide of SEQ ID NO: 1 which is associated with a dysfunction wiil provide a 
diagnostic tool that can add to, or detme. a diagnosis of a disease, or susceptibi!ir>- to a 
' n disease, which results from under-expression. over-expression or altered spatial or temporal 
expression of the said gene. Individuals carrying mutations in the gene mav be detected at 
the DNA level by a vanery of techniques. 

Nucleic acids for diagnosis may be obtained from a subject's cells, such as from blood. 

[ 5 unne, saliva, tissue biopsy or autopsy matenal. The genomic DNA may be used directly for 
detection or may be amplified enzymatically by usmg PGR or other amplification techniques 
pnor to analysis. RNA or cDNA may also be used in similar fashion. Deletions and 
inseruons can be detected by a change m size of the amplified product in companson to the 
normal genotype. Point mutations can be identified by hybndizing amplitled DNA to 

20 labeled CASB47 nucleotide sequences. Peifectly matched sequences can be distinguished 
from mismatched duplexes by RN'ase digestion or by differences in melting temperatures. 
DNA sequence differences may also be detected by alterations in electrophoretic mobility of 
DNA fragments m gels, with or without denaturing agents, or by direct DNA sequencing 
(ee. e.g.. Vlyers et a/.. Science (1985) 230: 1242). Sequence chaiiges at specific locations 

25 may also be revealed by nuclease protection assays, such as RNase and S 1 protection or the 
chemical cleavage method (see Cotton et ai. Proc Natl Acad Sci USA f 1985 ) 85; 4397- 
440 1 ). In another embodiment, an array of oligonucleoudes probes compnsing C ASB47 
nucleotide sequence or fragments thereof can be constructed to conduct efficient screemng 
of e.g.. genetic mutations. Array technology methods are well known and have general 

30 applicabihrv- and can be used to address a varierv' of questions in molecular genetics 

including gene expression, genetic linkage, and genetic variability (see for example: Vl.Chee 
et af. Science. Vol 274, pp 610-613 f 1996)). 

1" 
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The diagnosuc assays offer a process for diagnosing or determining a suscepnbilitv to 
cancers, auioimmune disease and related conditions through detection of mutanon in the 
CASB47 nucleotide sequence by the methods described. In addition, such diseases mav be 
diagnosed by methods compnsmg determming from a sample derived from a subieci an 
abnormally decreased or mcreased level of polypeptide or mRNA. Decreased or 
mcreased expression can be measured at the FL\A level using any of the methods well 
knoun in the an for the quantitation of polynucleotides. Assay techniques tliat can be use. 
to determme le\ eis of a protein, such as a polypeptide of the present inv ention, m a sample 
deni ed from a host are well-known to those of skill in the art. 

Thus in another aspect, the present invention relates to a diagnostic kit for performing a 
diagnostic assay which comprises: 

(a) a polynucleotide of the present invention, preferably the nucleotide sequence of SEQ 
ID NO: 1 . or a fragment thereof ; 

(b) a nucleotide sequence complementar\- to that of (a); 

(c ) a polypeptide of the present invention, preferably the polypeptide of SEQ ID N0:2 or 
a fragment thereof; or 

(d) an antibody to a polypeptide of the present invention, preferably to the polypeptide of 
SEQ ID N'0:2. 

The nucleotide sequences of the present invention are also valuable for chromosomal 
localisation. The sequence is specifically targeted to. and can hybridize with, a particular 
location on an mdividual human chromosome. The mapping of relevant sequences to 
chromosomes according to the present invention is an important first step in correlaiinc 
those sequences with gene associated disease. Once a sequence has been mapped to a 
precise chromosomal location, the physical position of the sequence on the chromosome can 
be correlated with genetic map data. Such data are found in. for example. V. McKusick. 
Mendelian Inhentance in Man (available on-line through Johns Hopkins Universit\' Welch 
Medical Librarv ). The relationship between genes and diseases that have been mapped to 
the same chromosomal region are then identified through linkage analysis (coinhentance of 
physically adjacent genes). The differences in the cDNA or genomic sequence between 
affecied and unaffected individuals can also be determined. 
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The polypeptides ot the invention or their fragments or analogs thereof, or cells expressme 
them, can also be used 35 immunogens to produce antibodies immunospecific for 
poly peptides of the present invention. The term '■inununospecific"* means that the antibodies 
have substantially greater affinity for the polypeptides of the invention tfian their affmitv for 
other related polypeptides in the pnor art. 

In a further aspect the invention provides an antibody immunospecific for a poK peptide 
according to the invention or an immunological fragment thereof as hereinbefore defined. 
Preferably the antibody is a monoclonal antibody 

Antibodies generated agamsi polypeptides of the present invention may be obtained bv 
admimstenng the polypeptides or epitope-bearing fragments, analogs or cells to an animal, 
preferably a non-human animal, using routine protocols. For preparation of monoclonal 
antibodies, any techmque which provides antibodies produced by continuous cell line 
cultures can be used. Examples include the hybridoma technique (Kohler, G. and Miistein. 
C. Nature ( 1975) 256:495-497). the trioma technique, the human B-cell hybndoma 
technique (Kozbor et ai.. Immunology Today (1983) 4:72) and the EBV-hybridoma 
technique fCole ei ai. Monoclonal Antibodies and Cancer Therapy, 77-96. Alan R. Liss. 
Inc.. 1985). 

Techniques for the production of single chain antibodies, such as those descnbed in U.S. 
Patent No. 4,946.778. can also be adapted to produce single chain antibodies to polypeptides 
of this invention. Also, transgenic mice, or other orgamsms, including other mammals, may 
be used to express humanized antibodies 

The above-described antibodies may be employed to isolate or to identify clones expressing 

the polypeptide or to purify the polypeptides by affinity chromatography. 

The antibody of the invention may also be employed to prevent or treat cancer, panicularK 

colon cancer, ovanan cancer, lung cancer and brain cancer, autoimmune disease and related 

conditions. 

Another aspect of the invention relates to a method for inducing or modulating an 
immunological response m a mammal which comprises inoculating the mammal with a 

IP 
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polypeptide of the present invention, adequate to produce antibody and or T cei; immune 
response to protect or ameliorate the symptoms or progression ot the disease \'e: 
another aspect of the mvention relates to a method of inducing or modulatms: 
immunological response in a mammal which comprises, delivering a polypeptiae of the 
presenr mvention via a vector directing expression of the poiynucieotide and codmi: for 
the polypeptide vivo m order to mduce such an immunological response to produce 
antibody to protect said animal from diseases. 

It will be appreciated Lhat the present invention therefore provides a method of treating 
abnormal conditions such as. for instance, cancer and autoimmune diseases, in particular, 
colon cancer, ovanan cancer, lung cancer and brain cancer, related to either a presence of ai 
excess of. or an under-express ion of. CASB47 polypeptide activity 

The present invention fiirther provides for a method of screenmg compounds to identity' 
those which stimulate or which uihibit the function of the CASB47 polypeptide. In general, 
agonists or antagonists may be employed for therapeutic and prophylactic purposes for such 
diseases as hereinbefore mentioned. Compounds may be identified from a vanery of 
sources, for example, cells, cell-free preparations, chemical libraries, and namrai product 
mixtures. Such agonists, antagonists or inhibitors so-identified may be natural or modified 
substrates, ligands. receptors, enzymes, etc., as the case may be. of tiie polypeptide: or may 
be strucniral or functional mimetics thereof (see Coligan ef ai. Current Protocols in 
Immunology U2):Chapter 5 (1991)). Screemng methods will be known to those skilled in 
the an. Further screening methods may be found in for example D. Bennett et ai. J Mol 
Recognition. 8:52-58 (1995); and K. Johanson et ai., J Biol Chem. :70(16):9459-947l 
( 1995) and references therein. 

Thus the invention provides a method for screening to identify compounds which stimulate 
or which inhibit the timction of the polypeptide of the invention which compnses a method 
selected from the group consisting of 

(a) measuring the binding of a candidate compound to the polypeptide (or to the cells or 
membranes bearing the polypeptide) or a fusion protein thereof by means of a label 
directly or indirectly associated with the candidate compound: 
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( b I measuring the binding of a candidate compound to the polypeptide i or to the celis or 
membranes bearing the polypeptide) or a fusion protem thereof m the presense of a 
labeled competuior; 

(C) testing whether the candidate compound results in a signal generated by acn\at!on or 
inhibition of the polypeptide, usmg detection systems appropriate to the cells or cell 
memDranes bearmg the polypeptide: 

(d) mixing a candidate compound with a solution containing a polypeptide of ciaim 1 . to 
form a mixture, measuring activity of the polypeptide in the mixture, and comparing the 
activit) of the mixture to a standard; or 
10 le) detecting the effect of a candidate compound on the production of mRNA encodini: 
said polypeptide and said polypeptide in ceils, using for instance, an ELISA assav. 

The polypeptide of the invention may be used to identify membrane bound or soluble 
1 5 receptors, if any. through standard receptor binding techniques known in the art. Well 
known screening methods may also be used to identify agonists and antagonists of the 
polypeptide of the invention which compete with the binding of the polypeptide of the 
invention to its receptors, if any. 

20 Thus, m another aspect, the present invention relates to a screening kit for identifying 
agonists, antagonists, ligands. receptors, substrates, enzymes, etc. for polypeptides of the 
present invention; or compounds which decrease or enhance the production of such 
polypeptides, which comprises: 

(a) a polypeptide of the present invention; 

21 (b) a recombinant cell expressing a polypeptide of the present invention; 

( c I a cell membrane expressing a polypeptide of the present invention; or 
Id) antibody to a polypeptide of the present invention; 
which polypeptide is preferably that of SEQ ID N0:2. 

30 It will be readily appreciated by the skilled anisan that a polypeptide of the present 
invention may also be used in a method for the structure-based design of an agonist, 
antagonist or inhibitor of the polypeptide, by: 

(a) determining in the first instance the three-dimensional structure of the polypeptide: 

21 
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lb} deducing the ihree-dimensionai strucmre for the iikely reactive or bmdinL^ sitei si of 
an agonist, antagonist or inhibitor; 

ic) symhesing candidate compounds that are predicted to bind to or react with the 
deduced binding or reactive sue; and 
: (d) testing whether the candidate compounds are indeed agonists, antasonists or 
inhibitors 

Gene iherap> may also be employed to effect the endogenous production or"CASB4'' 
polypeptide by the relevant cells in the subject. For an overview of gene therap\ . see 
1 0 Chapter 20, Gene Therapy and other Molecular Genetic-based Therapeutic Approaches. 
{ and references cited therein) in Human Molecular Genetics. T Strachan and A P Read, 
BIOS Scientific Publishers Ltd (1996). 

Vaccine preparation is generally described in Pharmaceutical Biotechnology Vol.61 
: 5 Vaccine Design - the subumt and adjuvant approach, edited by Powell and Newman. 

Plenum Press. 1995. New Trends and Developments in Vaccines, edited by Voller et al. 
University Park Press, Baltimore, Maryland. U.S.A. 1978. Encapsulation within 
liposomes is described, for example, by Fullerton. U.S. Patent 4,235.877. Conjugation of 
proteins to macromolecules is disclosed, for example, by Likhite. U.S. Patent 4.372,945 
:o and by .Armor et al.. U.S. Patent 4,474,757. 

The amount of protein in each vaccine dose is selected as an amount which induces an 
immunoprotective response without significant, adverse side effects in typical vaccinees. 
Such amount will vary depending upon which specific immunogcn is employed. 
:5 Generally, it is expected that each dose will comprise I-1000|ig of protein, preferably 
2-lOO.ug, most preferably 4-40^g. An optimal amount for a particular vaccine can be 
ascenained by standard studies involving observation of antibody titres and other 
responses m subjects. Following an inivial vaccination, subjects may receive a boost in 
about 4 weeks. 

"isolated" means altered "by the hand of man" from the natural state. If an "isolated" 
composition or substance occurs in nature, it has been changed or removed from its 
original environment, or both. For example, a polynucleotide or a polypeptide naturalK 
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present m a [ivmg animal is not -isolated." but the same polynucleotide or poi> peptide 
separated from the coexisting matenals of its natural state is "tsoiated"". as the term is 
employed herein 

"Polynucleotide" generally refers to any polyribonucleotide or poiydeoxnbonucieottde. 
which n^a) be unmodified R_\A or DN'A or modified RNA or DNA including smsjie and 
double stranded regions. 

•■\ ariant" refers to a polynucleotide or polypeptide that differs from a reference 
polynucleotide or polypeptide, but retains essential properties. A typical variant of a 
polynucleotide differs in nucleotide sequence from another, reference polynucleotide. 
Changes m the nucleotide sequence of the vanant may or may not alter the ammo acid 
sequence of a polypeptide encoded by the reference polynucleotide. Nucleotide changes 
may result m amino acid substitutions, additions, deletions, fusions and truncations in the 
polypeptide encoded by the reference sequence, as discussed below, A typical variant of 
a polypeptide differs in amino acid sequence from another, reference polypeptide. 
Generally, differences are limited so that the sequences of the reference polypeptide and 
the variant are closely similar overall and, in many regions, identical. A variant and 
reference polypeptide may differ in amino acid sequence by one or more substitutions, 
additions, deletions in any combination. A substituted or inserted amino acid residue 
may or may not be one encoded by the genetic code. A vanant of a polynucleotide or 
polypeptide may be a naairally occurring such as an allelic vanant. or it may be a vanant 
that IS not known to occur naturally. Non-naturally occurring variants of polynucleotides 
and polypeptides may be made by mutagenesis techniques or by direct synthesis. 

"Idenuu ." as known in the an. is a relationship between two or more polypeptide sequences 
or rwo or more polynucleotide sequences, as determined by comparing the sequences. In the 
art, '"identity" also means the degree of sequence relatedness between polypeptide or 
polynucleotide sequences, as the case may be. as determined by the match between 
strings of such sequences. "Identity" and "similarity" can be readily calculated by known 
methods, including but not limited to those described in (Computational Molecular 
Biology. Lesk. .A.M.. ed.. Oxford University Press. New York, 1988: Biocomputmg 
Informatics and Genome Projects, Smith. D.W., ed.. Academic Press. Neu York. 1993. 
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Compuier .-\nalysi5 of Sequence Data. Part I. Griffin. A.M.. and Griffm. H.G.. eds.. 
Humana Press. New Jer5e>-. 1994: Sequence .Analysis in Molecular Biolog> . von Heinie, 
G.. Academic Press. 1987. and Sequence .Analysis Primer. Gribskov. M. and Devereux. 
J . eds.. M Stockton Press. New York. 1991: and CariUo. H.. and Lipman. D . SIAM ,/ 
Applied Main 4.s. 1073 ( 1988f. Preferred methods to determine identitv are desiijned 
gi\ e tne larizesi match between the sequences tested. Methods to determine iden:;i\ and 
simiiariiy are codified in publicly available computer programs. Preferred computer 
program methods to determine identit>- and similarity between two sequences include, ou: 
are not limited to. the GCG program package (Devereux. J., et al.. Nucleic .Acids 
!0 Research llih: 387 (1984)). BLASTP. BL.ASTN'. and F.ASTA (Atschul. S.F. et ai.. J. 
Molec. Biol. :yi 403-410(1990). The BLAST X program is publicly available from 
NCBI and other sources (BLAST Manual, Altschui. S.. ai. NCBI NLM NIH Bethesda. 
MD 20894: AltschuL S., et al , J. Moi. Biol. 215: 403-410 (1990), The well knoun Smith 
Waterman algorithm may also be used to determine identity. 

15 

The preferred algorithm used is FASTA. The preferred parameters for polypeptide or 
polynuieotide sequence comparison using this algorithm include the following: 
Gap Penalty: 12 
Gap extension penalty: 4 
ZO Word size: 2, max 6 

Preferred parameters for polypeptide sequence companson with other methods include 
the following: 

1 ) .Algorithm: Needleman and Wunsch. J. Mol Biol. 48: 443-453 (1970) 
25 Comparison matrix; BLOSSUM62 from Hentikoff and Hentikoff. Proc. Natl. Acad. Sci. 
USA. 89:10915-10919 (1992) 
Gap Penalty: 12 
Gap Length Penalty: 4 

30 .A program useful with these parameters is publicly available as the "gap" program from 
Genetics Computer Group, Madison WI. The aforementioned parameters are the default 
parameters for polypeptide comparisons (along with no penalty for "end aaps) 

24 
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Preterred parameters for polynucleoiide comparison include the tollowing: 
1) Algonthrr.: Needleman and Wunsch, J. Mol Bioi. 48; 443-453 (1970) 
Comparison matrix: matches = -10, mismatch = 0 
Gap Penaity: 50 
_^ Gap Length Penalty: 3 

A program usetlil with these parameters is publicly available as the "gap" program rrom 
Genetics Computer Group. Madison W'l. The aforementioned parameters are ine defaui: 
parameters for polynucleotide comparisons. 

By way of example, a polynucleotide sequence of the present invention may be identical 
to the reference sequence of SEQ ID N0:1. that is be 100% identical, or it may include 
up to a cenain integer number of nucleotide alterations as compared to the reference 
sequence. Such alterations are selected from the group consisting of at least one 
nucleotide deletion, substitution, including transition and transversion. or insertion, and 
wherein said alterations may occur at the 5' or 3' terminal positions of the reference 
nucleotide sequence or anywhere between those terminal positions, interspersed either 
individually among the nucleotides in the reference sequence or in one or more 
contiguous groups within the reference sequence. The number of nucleotide alterations is 
20 determined by multiplying the total number of nucleotides in SEQ ID NO: 1 by the 

numerical percent of the respective percent identity(divided by 100) and subtracting that 
product from said total number of nucleotides in SEQ ID NO; I. or: 



wherein is the number of nucleotide alterations, is the total number of nucleotides 
25 m SEQ ID NO: I , and y is, for instance, 0.70 for 70%, 0.80 for 80%. 0.85 for 85%. 0.90 
for 90%. 0.95 for 95%, etc.. and wherein any non-integer product of and y is rounded 
down to the nearest integer pnor to subtracting it from x^- Alterations of a 
polynucleotide sequence encoding the polypeptide of SEQ ID N0:2 may create nonsense, 
missense or frameshift mutations in this coding sequence and thereby alter the 
30 polypeptide encoded by the polynucleotide following such alterations. 



On < Xn - (Xn • y). 
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Simiiariy. a polypeptide sequence of the present invention may be identical to the 
reference sequence of SEQ ID N0:2, that is be 100% identical, or it may include up to a 
cenain integer number of ammo acid alterations as compared to the reference sequence 
such :nat the ^ o identity is less than 100%. Such alterations are selected from tne group 
consisting of at least one ammo acid deletion, substitution, including conser\ ati\ e and 
non-conser\'ative substitution, or insertion, and wherein said alterations mas occur at the 
amino- or carboxy-terminal positions of the reference polypeptide sequence or anywhere 
betueen those terminal positions, interspersed either individually among the amino acids 
;n the reterence sequence or in one or more contiguous groups wtthin the reference 
sequence. The number of ammo acid alterations for a given % identity is determined b\ 
multiplying the total number of amino acids in SEQ ID N0:2 by the numerical percent of 
the respective percent identity(divided by 100) and then subtracting that product from 
said total number of ammo acids in SEQ ID N0:2. or: 

"a^^a - (^a • y)- 

wherein is the number of amino acid alterations, is the total number of amino acids 
m SEQ ID N0;2, and y is, for instance 0.70 for 70%. 0.80 for 80%, 0.85 for 85*^0 etc., 
and wherein any non-integer product of Xg and y is rounded down to the nearest integer 
prior to subtracting it from x^. 

"Homolog" is a generic tenn used in the an to indicate a polynucleotide or polypeptide 
sequence possessing a high degree of sequence relatedness to a subject sequence. Such 
reiatedness may be qundfied by determining the degree of identity and/or similanry between 
the sequences being compared as hereinbefore described. Falling withm this genenc term 
are the terms "ortholog", meaning a polynucleotide or polypeptide that is the functional 
equivalent of a polynucleotide or polypeptide in another species and "paralog'" meamng a 
functionally similar sequence when considered within the same species. 
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EX.AMPLES 
Example 1 

A database screening method used to select novel genes that are differentiallv 
expressed in cancers 

/./ Introduction 

A compiememar.' approach to experimental antigen discoverv' is to explore the human 
genome databases, particularly those of "Expressed Sequence Tags" (ESTs), in search ot' 
tumour-specific and Oimour-associated antigens. ESTs are small fragments 
(approximately 300 bp) of cDNA made from a collection of mRNA extracted from a 
particular tissue or cell line. Such currently provide a massive amount of ESTs (10'') from 
several hundreds of cDNA tissue libraries, including tumoural tissues from various types 
and states of disease. By means of specifically designed informatics tools, one can search 
in this database for a subset of potential antigen candidates, provided that artifacts are 
carefully avoided. To allow a specific selection, the libraries, from both healthy and 
diseased tissues, have first to be selected on the basis of different quality criteria ( tissue 
quality. library construction method, sequencing depth and quality, diversity index, 
frameshift). The EST sequences from these selected libraries are then compared to 
identify those genes specifically expressed, or significantly overexpressed, in tumoural 
tissues. Currently, the method is limited by the sequencing depth of these libraries, i.e. 
typically only about 10% of all the expressed genes of a particular tissue are represented 
by ESTs from a particular library. This limitation can be overcome by pooling tissue 
libraries. 

After a careful screening using a set of defined critena (novelty of the gene, putative 
expression pattern), the selected candidates can be funher tested for selective expression 
in normal and tumoural tissues, for example by RT-PCR. 

7.2 Method 

The ongmal EST database is reorganized by assembling all the fragments into 
overlapping "genetic clusters". There are several well known algorithms which can be 
used to produce these assemblies. Each resulting assembly is thus a consensus sequence 
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representing a tragment of, or a complete gene. This process reduces the total amount of 
information by one order of magnitude. 

The method allows to select candidates by "customized differential expression" bv 
^ raoRing the number of ESTs by customizable tissue categor> . 
The data are organized m a relational database comprisinc 

- Table "ESTs ': contains at least the EST names or Ids and the code of the cDNA librarv 
from which each EST was generated 

- Table "Assemblies"; contains at least for each assembly the list of EST components 
U) - Table "Libraries": contains at least for each cDNA librar>' its code, tissue or cell line 

type, disease state (normal, tumour or non-tumoural disease) 
Links are made between these tables as shown in Figure I. 

The cDNA libranes are then classified in 5 categones. each subdivided in several 2roups: 

li 

- groups of cell lines (separated normal & cancer cell lines) 

- groups of non-cancerous diseased tissues 

- group of fetal/embryoQic tissues 

- groups of normal tissues 
20 - groups of tumoural tissues 

- I group of unknown origin 

An additional table is then added to the relational database, called "Groups of libraries" as 
shown m Figure 2. 

The next step is the computing, for each assembly, of the number of ESTs ongmatmg 
from each group of libraries (for example using a Sybase Query Language query ): 

- For each assembly 

30 -For each EST 

- Check the corresponding library code 

- Assign the corresponding group code 

- Count the number of ESTs assigned to the same group code 

28 
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- Count the total number of ESTs 



The result of this step is a table called "Results" coniammg one line per assembly, and 
one column per group code, and containmg the final EST counts. 

The table "Results" is used to compute several quantities for each assembly. ■ 

• JL "lumour-to-normat ratio" (TNRi: 

10 (sum of EST_T. EST_F, EST_P. EST_0, EST_Tes. EST_Pl)/total number of ESTs 

where (EST_T) is the sum of all ESTs belonging to the groups that are tissue or cell Ime 
tumours. (EST_F ) is the sum of all ESTs belonging to groups of fetal tissues or cell lines. 
EST_P IS the total number of ESTs from normal prostate tissue. EST_0 is the sum of 
ESTs from normal ovanan tissues, EST_Tes is the total number of ESTs from normal 

1 5 testis tissues or cell lines, and EST_Pl is the total number of ESTs from placenta tissues. 
Note that some normal tissues are included, namely "dispensable" tissues like testis, 
ovarv' and prostate, which may often share expression patterns with tumours ( the so- 
called "cancer-testis antigens" or CT-aniigens). as well as placental, fetal and embryonic 
tissues. 



• Any other sum that is relevant to select candidates for a specific target cancer or type 
of cancers. As an example, one may sum the ESTs from the groups of libraries from 
tumour tissues or cell lines representative of breast tumours and from testis and fetal 
tissues. This may be relevant to detect the above-mentioned CT-antigens. 



The resulting table is called "Customized results", and contains one line per assembly and 
one column per computed sum as well as any other relevant information, such as the total 
number or ESTs. 

30 The "Customized results" table is then sorted according to the desired use. .-\ relevant 

sorting is to use the TN'R column as a primar\' sorting key, and the customized sum as the 
secondary' soning key. 
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Each assembly over a defined threshold dor example: TNR > O.S) is then comparea to j 
sequence database of ioiow-n genes or gene products using any sequence comparison 
algorithm (for example Blast) to screen for novelty of the gene, [n a similar vva> . a 
sequence comparison can be performed usmg the original EST or assembl>- datacase to 
check for alternate splicing variants. 



1.3 Results 

Table 1: EST Distribution 



) ID Librarv' 
NCBi 40518a 
NCBI:5020- 
NCBI 92203T 
NCBI:9329i: 
\CB 1:934 101 
NCB[:934638 
NCBI; 1 0400 1 1 
NCBI: 1122876 
NCB[:II71566 
NCBI: 1 180435 
NCBI: 1073825 
NCBI: 1073930 
NCB1:945746 
NCB1;I43I531 
NCBi:145I557 
NCBI:!462735 
NCBI:1463384 
NCBI.1553"58 
NCBI: 1558729 
NCBI:I756330 
NCBI: 1^57672 
NCB[;i850298 
NCBt:!95062! 
NCBI: (992275 
NCB 1.2020608 
NCB[:2050335 
NCBI:2050343 
NCBI:20731 10 
NCBI:2075055 
NCBI. 2158404 



ID Library* Name 
Scares melanocv-re 2NbHM 
Scares fetai lung NbHLl9W 
Scares NbHTGBC 
Scares ovan. tumor NbHOT 
Scares ovar> tumor NTjHOT 
Scares ovarv- tumor NbHOT 
Scares ovarv tumor NbHOT 
Scares ovary tumor NbHOT 
NCI_CGAP_Co4 
NCI_CGAP_Co5 
Scares ovary tumor NbHOT 
Scares ovarv- tumor NbHOT 
Aorta endothelial ceils. TNF 
2 Scares NFL TGBC SI 
Scares NbHTGBC 
Scares NbHTGBC 
Soarcs NbHTGBC 
Scares NbHTGBC 
Scares NbHTGBC 
2 Scares NFL TGBC SI 
2 Scares NFL TGBC SI 
Scares NhHMPuSl 
Scares fetal lung NbHL!9W 
2 Soares NFL TGBC SI 
Scares fetal liver spleen IN 
Soares placenta 8 to 9 weeks 
Soares placenta 8 to 9 weeks 
6 NCI_CGAP_Lu5 
Soares placenta 8 to 9 weeks 
1 NCI CGAP Bm25 



In summap.'. the EST distribution profile shows expression in ovary tumours, colon 
tumours, lung tumours and brain tumours. 



30 



wo 99/49030 



PCT/EP'*9/0I89: 



1.4 References 

Wan J- S.. Sharp S. J.. Poiner G. M.-C, Wagaman P, C. Chambers J.. Pyati J.. Horn 
L., Galindo J. E.. Huvar A., Peterson P. A.. Jackson M. R.. Eriander M. G. ( l^^^h) Nature 
Biotechnol. 14. 1685. 
^ Pardol! D. M. ( 1P96) Curr. Opm, Immunom. 8. 619. 

Example 2 

Qualitative RT-PCR ampUfication 

Presence of mRNA transcnpts in a panel of normal tissues and a small number or tumour 
samples is evaluated by non-quantitative RT-PCR. 

Total RNA from 19 normal tissues and 3 tumour samples was purchased from 
InVitrogen. mRNA is purified from total RNA after DNAse treatment usmg oiigo-dT 

? magnetic beads (Dynal ). 200 ng of mRNA are reverse transcribed (Expand reverse 
transcriptase, Boehnnger) in a 20 ul reaction and 2 fil of this reaction are amplified by 
PCR (.AmpIiTaq Gold, Perkm-Elmer) for 32 cycles (Perkin-Elmer 9600 thermocycler) 
using standard protocols. Non-template controls (NTC) are always included. 
.Amplification products (10 |il) are visualised on ethidium bromide-stained agarose gels. 

) Oligonucleotides for PCR amplification are designed by computer (LaserGene 

PnmerSeiect module). Specificity of the designed oligonucleotides is evaluated in siiico 
by comparing their sequences to the sequences in the public databases using the FASTA 
algonthm. Transcnpts of the housekeeping GAPDH gene are amplified under identical 
conditions on all tissue samples. GAPDH serves as a positive control and provides a 
visual reference of a highly expressed gene. Detection of CASB47 mRNA in 1 9 normal 
ussues and 3 tumour samples by RT-PCR is shown in Figure 3. 

Example 3 

Real-time RT-PCR analysis 

Reai-time RT-PCR (U. Gibson. 1996. Genome Research: 6.996) is used to compare 
mRN A transcript abundance of the candidate antigen in tumour and normal colon tissues 
from multiple patients. In addition. mRNA levels of the candidate gene are re-evaluated 
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by this approach m a pane: of normal tissues. 



Total RNA is extracted from snap frozen colon tissue biopsies using TriPure reagent 
iBoehringer). Total RNA from normal tissues is from InVitrogen as above. PoK -A* 
mRN.A IS purified from total RNA after DNAase treatment using oligo-dT magnetic 
bea^s I Dynai ). Ouantitlcation of the mR_\A is performed by spectrotluorimetn i BioRad 
using Sybrll dye (Molecular Probes). Primers for amplification are designed uirh the 
Perkin-Eimer Primer Express software using default options for TaqMan amplifcation 
conditions. 



Real-time reactions are assembled according 10 standard PCR protocols usinn 2 of 
purified mRNA for each reaction. SybrI dye (Molecular Probes) is added at a final 
dilution of 1 75000 for real-time detection. Amplification (40 cycles) and real-time 
detection is performed in a PE 7700 system. Ct values are calculated using the 7700 
Sequence Detector software for the tumour (Ctl) and normal (CtN) samples of each 
patient. The difference between Ct values (CtN-CtT) is a direct measure of the difference 
in iranscnpt levels between the t^imour and normal tissues. As Ct values are log-linearly 
related to copy number and that the efficiency of PCR amphfication under the prevailing 
experimental conditions is close to the theoretical amplification efficiency. 2 is an 

estimate of the relative transcript levels in the two tissues (i.e. fold mRNA over- 
expression in tumor). The percentage of over-expressmg patients and the average level of 
mRN'A over-expression in the tumours of these patients is calculated from the data set of 
1 8 patients. In addition, Ct values obtained with 12 normal tissues are provided for the 
candidate antigen and beta-actm. 



Table 2: 



Patients over-expressing CASB4T 
in colon tumours 

(%) 



Average level of over-expression 
in colon tumours 
(fold) 



Table 3: Real-time RT-PCR Ct vai ues for CASB47 and actin in 12 normal tissues. 
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Le»;end. Bla: bladder. Bra: brain, Bre: breast, Cer: cer\-i\. Hea: hean. Kid: kidne> . Li\ ■ 
ii\er. iun: iung. Oes: oesophagus. Pla: placenta, Rec: rectum. Ute: uterus. 



Example 4 

Identification of the full length cDNA sequence 

Colon tumour cD\A libraries are constructed using the Lambda Zap [I system 
(Stratagene) from 5 |ig of polyA* mRNA. The supplied protocol is followed except that 
Superscriptll (Life Technologies) is used for the reverse transcription step. Oliao dT- 
pnmed and random-pnmed libraries are constructed. About 1.5 xlO^ independent phace 
are plated for each screening of the library. Phage plaques are transferred onto nylon 
filters and hybridised using a cDNA probe labelled with AlkPhos Direct. Positi\e phage 
are detected by chemilummescence. Positive phage are excised from the agar plat, eiuted 
in 500ul SM buffer and confirmed by gene-specific PCR. Eiuted phage are converted to 
single strand Ml 3 bacteriophage by in vtvo excision. The bacteriophage is then converted 
to double strand plasmid DNA by infection of £ coli. Infected bacteria are plated and 
submitted to a second round of screening with the cDNA probe. Plasmid DNA is purified 
from positive bacterial clones and sequenced on both strands. 

When the full length gene cannot be obtained directly from the cDNA librar> . missing 
sequence is isolated using RACE technology (Marathon Kit, ClonTech.). This approach 
reiies on reverse transcribing mRN'A into double strand cDNA. ligating linkers onto the 
ends of the cDNA and amplifying the desired extremity of the cDNA using a gene- 
specific primer and one of the linker oligonucleotides. Marathon PCR products are cloned 
into a plasmid (pCRII-TOPO, InVitrogen) and sequenced. 



E.xample 5: 
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5.1 Expression and purification of tumour-specific antigens 



Expression m microbial hos:s rs used to produce me antigen of the invention ror \ accine 
purposes and :o produce protein fragments or whole protein for rapid purificauon and 
generation of antioodies needed for characterization of the naturally expressed protein b\ 
[mmunohistochemistry' or for rbllou-up of purification. 

Recombinant proteins may be expressed in two microbial hosts. E. coli ana :n > east 
(such as Saccharomyces cerevisiae or Pichia pastons). This allows the selection of the 
expression 5> stem with the best features for this panicular anugen production. In 
general, the recom.binant antigen will be expressed in E. coii and the reagent protein 
expressed in yeast. 

The expression strategy first involves the design of the primary structure of the 
recombinant antigen. In general an expression fusion partner (EFP) is placed at the N 
terminal extremity to improve levels of expression that could also include a region useful 
for modulating the immunogenic properties of the antigen, an inamune fusion partner 
(If?). In addition, an affinity fusion partner (AFP ) useful for facilitating further 
purification is included at the C-terminal end. 

VVhen the recombinant strains are available, the recombmant product is characterized bv 
the evaluation of the level of expression and the prediction of funher solubility of the 
protein by analysis of the behavior in the crude e.xtract. 

After growth on appropnate culture medium and induction of the recombinant protein 
expression, total extracts are analyzed by SDS-PAGE. The recombinant proteins are 
visualized in stained gels and identified by Western blot analysis using specific 
antibodies. 

A comparative evaluation of the different versions of the expressed antigen w ill allow the 
selection of the most promising candidate that is to be used for funher purification and 
immunological evaluation. 

The purification work follows a classical approach based on the presence of an His 
affinit) tail in the recombinant protein. In a typical experiment the disrupted cells are 
filtered and the acellular extracts loaded onto an Ion Metal Affinity Chromatography 
(IMAC. Ni~NTA from Qiagen) that will specifically retain the recombinant protein. 
The retained proteins are eiuted by 0-500 miM Imidazole gradient (possibly in presence of 
a detergent) in a phosphate buffer. This step is optimally followed by an Anion Exchange 
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resin step and a Size Exclusion chromatography step depending on the success of the 
Imac step and the nature of the coniaminants. 

5.2 Antibody production and immunohistochemistry 

Small amounts of reiatively punned protein can be used to generate immunoioiiica! 
tools in order to 

a) detect the expression by immunohistochemistry in normal or cancer tissue sections, 
hi detect the expression, and to follow the protein during the purification process 
(ELISA Western Blot): or 

ci characterise/ quantify- the purified protein (ELISA). 

5.2.1 Polyclonal antibodies: 
Immunization 

2' 3 Rabbits are immunized . imramusculariy (I.M.) , 3 times at 3 weeks internals with 
lOOug of protein, formulated in the adjuvant 3D-MPL/QS21 . 3 weeks after each 
immunisation a blood sample is taken and the antibody titer estimated in the serum by 
ELISA using the protein as coating antigen following a standard protocol. 

ELISA 

96 well micropiaies (maxisorb Nunc) are coated with 5(ig of protein overnight at 4°C. 
After Ihour saturation at 37°C with PBS NCS 1%, senal dilution of the rabbit sera is 
added for IH 30 at 37°C {starting at 1/10). After 3 washings in PBS Tween, ami rabbit 
biotinylated anti serum (Amersham ) is added (1/5000). Plates are washed and 
peroxydase coupled streptavidin (1/5000) is added for 30 min at 37='C. After washing. 
50|il TMB (BioRad) is added for 7 mm and the reaction then stopped with H:S04 0.2M. 
The OD can be measured at 450 rnn and midpoint dilutions calculated by SoftmaxPro. 

5.2.2 Monoclonal antibodies: 
Immunization 

5 BALB/c mice are immunized 3 times at 3 week intervals with 5 |ag of punfied protein. 
Bleedings are performed 14 days post II and 1 week post 3. The sera is tested by Elisa 
on purified protein used as coated antigen. Based on these results (midpoint dilution > 
10000 ) one mouse is selected for fusion 



w o 99 49030 



PCT/EP99 01892 



Fusion,^ K4 Tse lection 

Spleen cells are fused with the SP2. 0 myeloma accordmg to a standard protocol usmc 
PEG JO'^o and DMSO 5%. Celis are then seeded m 96 well plates 2.5 xlO' - liV 
5 cells u eli and resistant clones will be selected m HAT medium. The supernatant of these 
h> bridomas will be tested tor their content m specific antibodies and when positn e. 
will be submitted to 2 cycles of limited dilution . After Z rounds of screenmi:. 3 
h> bndomas will be chosen for ascitis production. 

\n 5.2.3 Immunohistochemistrv 

When antibodies are available, immuno staining is performed on normal or cancer 
tissue sections, in order to determine : 

• the level of expression of the protein antigen of the invention in cancer relative to 
normal tissue or 

! 5 "the proportion of cancers of a certain type expressing the antigen 

• if other cancer types also express the antigen 

• the proportion of ceils expressing the antigen in a cancer tissue 

• the cellular localisation of the antigen 

20 Tissue sample preparation 

.After dissection, the tissue sample is mounted on a cork disk in OCT compound and 
rapidly frozen in isopentane previously supercooled in liquid nitrogen (-l60Xi. The 
block will then be conserved at -70°C until use. 7- 1 0.um sections will be realized m a 
cryostai chamber (-20, -30°C). 

25 

Siaimn^ 

Tissue sections are dried for 5 min at room Temperature (RT). fixed m acetone for 
lOmin at RT.dned again, and saairated with PBS 0.5% BSA 5% serum. After 30 min at 
RT either a direct or indirect staining is performed using antigen specific antibodies. A 
30 direct staining leads to a better specificity but a less intense staining whilst an indirect 
staining leads to a more intense but less specific staining. 
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5.3 Analysis of human cellular rmnaune responses to the antigen of the invention 

The immunological relevance of the antigen of the invention can be assessed b> in viiro 
priming of human T cells. All T cell lymphocyte Imes and dendritic ceils are derived 
from PBVICs (peripheral blood mononuclear cells) of healthy donors (preferred HLA-a: 
^ subtype t. An HLA-A2. 1 'K^ transgenic mice is also used for screening of HLA-A2.! 
peptides. 

Ne\v|> discovered antigen-specitic CDS- T cell lines are raised and maintained hN 
vveekiv m vuro stimulation. The lytic activity and the y-IF\ production of the CD8 line^ 
10 in response to the antigen or antigen derived-peptides is tested using standard procedures. 

Two strategies to raise the CD8+ T cell lines are used: a peptide-based approach and a 
whole gene-based approach. Both approaches require the full-length cDNA of the newl> 
discovered antigen in the correct reading frame to be either cloned in an appropriate 
15 deliver>' system or to be used to predict the sequence of HLA binding peptides. 

Peptide-based approach 

The HLA-A2 binding peptide sequences are predicted by the Parker's algorithm. Peptides 
are then screened in the HLA-A2.I/K^ transgenic mice model (Viiiello et al.}. Briefly. 
20 transgenic mice are immunized with adjuvanted HLA-A2 peptides, those unable to 
induce a CDS response fas defined by an efficient lysis of peptide-pulsed autologous 
spleen cells) will be further analyzed in the human system. 

Human dendritic cells (cultured according to Romani et al.) will be pulsed with peptides 
and used to stimulated CD8-soned T cells (by Facs). After several weekly stimulations. 
Z5 the CDS lines will be first tested on peptide-pulsed autologous BLCL (EBV-B 

transformed cell lines). To venty the proper in vivo processing of the peptide, the CDS 
lines will be tested on cDN'A-transfected rumour cells (HLA-A2 transfected LnCaP. 
Skov3 or CAMA rumour cells). 

30 Whole gene-based approach 

CDS- T cell lines will be primed and stimulated with either gene-gun transfected 
dendritic cells, retrovirally transduced B 7. 1 -transfected fibroblastes. recombinant pox 
virus (Kim et al. ) or adenovirus (Burterfield et al.) infected dentridic cells. Virus infected 
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cells are ver.' efficient to present aniigemc peptides since the antigen is expressed at hish 
level but can only be used once to avoid the over-growth of viral T cells lines 

After alternated stimulations, the CD8 lines are tested on cDNA-transfected tumour cells 
as indicated above Peptide specificity and identit>' is determined to confirm the 

;mmunoloeical \aiidation. 
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Claims 



20 



1 . An isolated polypeptide comprising an amino acid sequence which has at least 70% 
identirv to the amino acid sequence of SEQ ID N0:2 over the entire length of ot SEQ ID 



2. .\n isolated polypeptide as claimed in claim 1 in which the amino acid sequence has at 
leas: ^^^5% identity. 

3. The polypeptide as claimed m claim 1 comprising the amino acid sequence of SEQ ID 



4. The isolated polypeptide of SEQ ID N0:2. 

5. A polypeptide comprismg an immunogenic fragment of a polypeptide as claimed in 
any one of claims 1 to 4 in which the immunogenic activity of the immunogenic fragment 
is substantially the same as the polypeptide of SEQ ID N0;2 

6. .An isolated polynucleotide comprising a nucleotide sequence encoding a polypeptide that 
has at least 70% identity to the ammo acid sequence of SEQ ID N0:2. over the entire length 
of SEQ ID N0;2; or a nucleotide sequence complementary to said isolated polynucleotide. 

7. .An isolated polynucleotide comprismg a nucleotide sequence that has at least 70% 
identit}' to a nucleotide sequence encoding a polypeptide of SEQ ID N0:2. over the entire 
coding region; or a nucleotide sequence complementary to said isolated polynucleotide. 

8. .An isolated polynucleotide which compnses a nucleotide sequence which has at least 
70°/o identity to that of SEQ ID NO; I over the entire length of SEQ ID NO: I ; or a 
nucleotide sequence complementar>' to said isolated polynucleotide. 

9. The isolated polynucleotide as defined in any one of claims 6 to 8 in which the 
identity is at least 95%. 



N0:2. 



N0:2 
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iO. An isolated poiynucleoude seiecied trom: 

(a) a polynucleotide compnsmg a nucleotide sequence encoding the polypeptide or'SEQ 
ID SO:Z: 

( b I the polynucleotide of SEQ ID NO: 1 ; and 
:= I CI a polynucleotide obtainable by screening an appropnate library under stnnceni 
hybridization conditions vMth a labeled probe having the sequence of SEQ ID NO: i or a 
rragmem thereof; 

or a nucleotide sequence complementary to said isolated poiv nucleotide 

iO 11. .\n expression vector or a recombinant live microorganism comprising an isolated 
poU nucleotide according to any one of claims 6-10. 

12. A host cell composing the expression vector of claim 1 1 or a membrane thereof 
expressing the polypeptide of claim 1. 

15 

13, A process for producmg a poiypeptide of claim 1 comprising culmnng a host cell of 
claim 12 under conditions sufficient for the production of said polypeptide and recovermu 
the polypeptide from the culture medium. 

20 14. A vaccine comprising an effective amount of the polypeptide of any one of claims 1 
to 5 and a pharmaceutical ly acceptable carrier. 

15. A vaccine comprising an effective amount of the polynucleotide of any one of claims 
6 to 10 and a pharmaceutically effective carrier. 

25 

16. A vaccine comprising an effective amount of antigen presenting cells, modified by m 
vitro loading with a polypeptide of any one of claims 1 to 5, or genetically modified in 
vitro to express a polypeptide of claim I and a pharmaceutically effective carrier 

30 1 7. .A vaccine as claimed in any one of claims 14 to 16 which additionally comprises a 
TH-1 inducing adjuvant. 
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18. A vaccine as claimed in claim 1 7 m which the TH- 1 inducing adjuvani is selected 
from the group of adjuvanis compnsmg: 3D-MPL, QS2 1 . a mixture of QS2 1 and 
choiesterol. and a CpG oligonucleotide. 

1*5. An antibody immunospecific for the polypeptide or immunoiogica! fragmen: as 
claimed in an> one of claims I to 5. 

20. A method for screening to identifv compounds which stimulate or whicn inhibit the 
function of the polypeptide of any one of claims I to 5 which comprises a method selectea 
trom the group consisting of: 

(a) measuring the binding of a candidate compound to the said polypeptide (or to the ceils 
or membranes bearing the polypeptide) or a fusion protein thereof by means of a label 
directly or indirectly associated with the candidate compound; 

( b) measuring the binding of a candidate compound to the said polypeptide { or to the 
cells or membranes bearing the polypeptide) or a fusion protein thereof in the presense of 
a labeled competitior; 

(c) testing vvhether the candidate compound results in a signal generated by acti\'ation or 
inhibition of the said polypeptide, using detection systems appropriate to the cells or cell 
membranes beanng the polypeptide; 

f d) mixing a candidate compound with a solution contaimng a polypeptide of any one of 
claims I to 5. to form a mixture, measuring activity of the polypeptide in the mixture, and 
comparing the activity of the mixture to a standard; or 

(e) detecting the effect of a candidate compound on the production of mRN A encoding 
said polypeptide and said polypeptide in cells, using for instance, an ELISA assay, 

21 . A method for the treatment of a subject by immunoprophyiaxis or therapy 
composing m vitro induction of immune responses to a molecule of any one of claims 1 
to 5. using m vitro incubation of the polypeptide of any one of claims I to 5 or the 
polynucleotide of any one of claims 6 to 10 with cells from the immune svstem of a 
mammal, and reinfusing these activated immune cells to the mammal for the treatment of 
disease. 
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22. A method as ciaimed in ciaim 21 wherein the treatment is for ovanan. coioa, iunz or 
brain cancer. 

23 \n agonist or antagonist to the pol\'peptide of claims 1 to 5. 

24 A compound which is: 

(a) an agonist or antagonist to the polypeptide of claims I to 5; 

(b) isolated polynucleotide of claims 6 to 10: or 

(c) a nucleic acid molecule that modulates the expression of the nucleotide sequence 
10 encoding the polypeptide of any one of claims 1 to 5; 

for use m therap\ . 

25. A process for diagnosing a disease or a susceptibility to a disease in a subject related 
to expression or activity of the polypeptide of any one of claims 1 to 5 in a subject 

!5 comprising: 

(a) determining the presence or absence of a mutation in the nucleotide sequence 
encoding said polypeptide in the genome of said subject; and/or 

(b) analyzing for the presence or amount of said polypeptide expression in a sample 
derived from said subject. 

20 

26. An isolated polynucleotide selected form the group consisting of: 

(a) an isolated polynucleotide compnsing a nucleotide sequence which has at least 70% 
identity to SEQ ID N0:3 over the entire length of SEQ ID N0:3; 

(b) an isolated polynucleotide compnsing the polynucleotide of SEQ ID N0:3; 
25 ( c) the polynucleotide of SEQ ID N0:3; or 

(d) an isolated polynucleotide comprising a nucleotide sequence encoding a polypeptide 
which has at least 70% identity to the ammo acid sequence of SEQ ID NO:4, over the entire 
length of SEQ IDN0:4. 

30 27. A polypeptide selected tTom the group consisting of: 

(a) a polypeptide which comprises an amino acid sequence which has at least 70% 
identity to that of SEQ ID N0:4 over the entire length of SEQ ID N0:4; 
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(b) a polypeptide in which the ammo acid sequence has at least 70% identiiy to the ammo 
acid sequence of SEQ ID N0:4 over the entire length of SEQ ID N0:4; 

( c ) a polypeptide which compnses the amino acid of SEQ ID N0:4; 

(d) a pol>'peptide which is the polypeptide of SEQ ID N0:4; or 

(c) a poKpeptide which is encoded by a polynucleotide comprising the sequence 
contained in SEQ ID M0:3. 

28. A live vaccine composition compnsmg an expression vector or recombinant live 
micro-organism according to claim I 1. 
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Figure 1 
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Figure 2 
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Figure 3: Detection of CASB47 mRNA in 19 normal tissues and 3 tumour 
samples by RT-PCR. 

Lanes from left to nght are (normal tissue: N; aimour tissue: T): breast(>;), breast (T). 
colon On), colon (T). lung (S), lung (T), brain (N), hean (N), kidney (N). oesophagus 

NIC. bladder fM), cervix fN). muscle (N). pancreas (N), placenta fN). recmm 
(N). skin (S), spleen (N), stomach fN). testis ps'), uterus (N), NTC. Transcnpts are 
delected ai very low levels m all tissues tested. 
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c-gaccggcg 


crgcgggggc 


cgacacgtcg 


ccccggcctg 


ggcgcgagga 


cgtgggggct 


? C 


gcgggcgcgc 


ggcggccgcg 


cgagccgccg 


gagcaggagc 


tgcagcgacg 


" cgggagcag 


12C 


aagcgg-ggc 


gacacgacgc 


gcagcagctg 


cagcagccca 


agcacctgga 


gtcct tr tac 


13 0 


gaaaaacct:- 


c"crt;gggct 


tatcaaggaa 


gacgagacca 


agccagaaga 


t cgcar acca 


24 0 


gatczaccag 


gcaatgaaca 


cgccagggaa 


t r tctggccc 


atgcaccaac 


^aaaggact t 


2QC 


-ggacgzcac 


cggggaaaga 


agCcaaagt t 


acgcagtgtt 


ggcgtcgcaa 


acgccatggt 


3 6 2 


caccgaacgg 


gtgacaaaga 


acgccctttc 


tcratcaaag 


gcaaccaaaa 


gt^agagcag 


42C 


1 - ragagtgg 


cacargaaga 


- cccaigcat: 


gacarcacac 


gagacaacaa 


acgacacgaa 


-i 5 2 


aagga -gcaa 


ggacacagca 


grtaaaacag 


ccactggagg 


atitctacctc 


agacgaagac 




aggagcagct 


ccagttcctc 


tgaaggtaaa 


gagaaacaca 


agaaaaagaa 


gaagaaagaa 


6C0 


aagcataaga 


aaaggaagaa 


agaaaagaaa 


aagaagaaaa 


aacggaagca 


caaatcttcc 


66 0 


aagt caaatg 


agggttctga 


c-cagagcga 


caaggatg-g 


act tgctcaa 


cat tctct tc 


^2 C 


ccaaacactg 


accaaggaac 


agaggaagar 


gcagtcagag 


aaagcagcag 


gatagagacg 


76 0 


3cgagagagg 


agrata-gtig 


ggtcacagca 


g:gagc-ccc 


acccgcctrg 


-agcgaagat 


842 


grgaccccag 


gagagggagc 




aggcgctagc 


cctggacagc 


agccgacrtc 


90C 


aggcaggaaa 


gtt zcttcat 


cgccgccctc 


-crgc-gg-c 


aca.gagtitc 


acgatccct: 


9GC 


c gaagtgc 3t 


cccacagggt 


ggcaggactg 


ggagaac ct c 


cgaggcg::gt 


etc rcaggcc 


1 C 2 2 


ct ccracagc 


ttgtgccct " 


cacagtgtgg 


actcaggt rc 


cacagaca'ic 


aggcrggagc 


1280 




gt 
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sac ier.£ 



iei- Se- Ar? Pr- 31y Ar- "l. Asc Val 31y Ala Ala Gly Ala. Arg 
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30 



^.la rr: .-.--^ .r: -r: Z-',.i Leu 3-n Arg Arg Ar^ Zl^ 31n 

;c 

_V-s ;.r= Arr Arr His A-p Ala 3." 3in :.eu 31- Leu Lys His Le- 

45 

- - ^ = - -'-■^ - - - 5 ? ? rz ? r !■ 



;:y Leu :.e lys Zl^ Asp SI; 



Tnr ^y^ Pr- 31 ■ Asc Cys Pre Asp Va: ?rc Gly Asn Glu His Ala 

- - " 3C 

Arq 3... Fr.-e le^ Ali His Ala Pre Thr Lys Gly Leu Trp Met ?rz Le- 

55 30 ^5 

31y Ly3 Gl^ 7^. i^ys Val Met Glr. G'y£ Trp Arg Cys Lys Arg Tyr Glv 

i:: 1C5 

His Arg Thr Gly Asp Lys Glu Cys Pro Phe Pne He Lys Gly Asn Glr. 

11= 120 125 

Lys Leu Glu Gin Phe Arg Val Ala H±s Glu Asp Pre Met Tyr Asp He 

13C 135 14C 

lie Arg Asp Asr. Lys Arc His Glu l^ys Asp Val Arg He Gin Glr. Leu 
i-i- iSC 155 16C 

Ly3 Gl- Leu Leu Glu Asp Ser Thr Ser Asp Glu Asp Arg Ser Ser Ser 

1€5 170 1^= 

Ser Ser Ser Gl- 3-y Lys Glu Lys His lys Lys Lys Lys Lys Lys Glu 

la: 19^ ^5^, 

Lys Hi- lys lys Arg lys Lys Gl„ Lys lys lys lys Lys lys Arg Lys 

-55 20C 205 

His Lys Ser Ser _ys Ser Asn Glu Gly Ser Asp Ser Glu 
^-C :i5 22C 



<210> 2 

^:i:> 505 

cZi2^ DNA 

<212> r^oTc sapiens 



c 4 C 0 3 

grggc-gcgc gaccgccgga ccaggagctg -aacgacgtr gggagcagaa gcgcggcgac SG 

acgaccgcag accigcacca gr-.zaagcac zcggagtcct tttacgaaaa azzzzzzzcn L2Z 

gggz--a':ra aggaagacga gac^aaqcca gaagatrgca taccagatgt accaggcaat 13: 
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gggaat r tzz 


ggct ra ;c:ca 


zcaaciaaag 


gaccc ::ggar 


gccaccgggg 


24C 


aaagaagt ca 


aagt 1 atgca 


gtg; r^ggcg- 


cgcaaacgc:: 


acggtcaccg 


aacgggcgac 


i Ij C 


aaagaazgcz 


"tttc-tra: 


zaaaggcaac 


caaaagrtag 


agcagt-cag 


agtggcacat 


3SC 


aaaqazccca 


tgca-gacat 


caiacgagac 


aacaaacgac 


a- gaaaagga 


cgtaaggata 


42C 


cagzagt zaa 


aacag:: r a r " 


ggacgait-i 


acctcagatg 


aagacaggag 


catccagtcc 


480 






"gaa r 








SO 5 
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23 



< 4 0 3 3 4 

r-ro ?rD Glu Gin Glu Leu 31n Arg Arg Arg Giu Gin Lys Arg Gly Asp 

Tnr Thr Ala Glu Leu Gin Gin Leu Lys His Leu Glu Ser Phe Tyr Glu 

2C 25 30 

Lys Pre Pro Pre Gly „eu He Lys Glu Asp Glu Thr Lys Pro Glu Asp 

3= 4C 45 

Cys lie Pro Asp Val Fro Gly Asn Glu His Ala Arg Glu Phe Leu Ala 

55 60 
His A.a Pre Thr Lys Gly Leu Trp Mer Pro Leu Gly Lys Glu Val Lys 

7C 75 80 

Val Met: Gin Cys Trp Arc Cys Lys Arg Tyr Gly His Arg Thr Gly Asp 

35 9C 95 

Lys Glu Cys Pro Phe Phe He Lys Gly Asn Gin Lys Leu Glu Gin Phe 

10- 105 110 

Arg Val Ala His Glu Asp Pro Met Tyr Asp lis He Arg Asp Asn Lys 

H5 120 125 

Arg His Glu Lys Asp Val Arg He Gin Gin Leu Lys Gin Leu Leu Glu 

-30 135 140 

Asp Ser Thr Ser Asp Glu Asp Arg Ser He Gin Ser Leu Ser Arg Ser 
150 155 16C 

Arg Arg Ala Glu 
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SEQUENCE INFORMATION 



SEQ ID NO:l 

CCGAGCGGCGCTGCGGGGGCTGACatgtcgtcccggccigggcscaacsaciniiiZLiiziicicc'-onc^'c 

gcggcggccgcgigagccgccggagcaggagctgcagcgacgtcggVagcagaa^ceccaoc^acac^^^ 

tgcagcagcicaagcacctggagtcctmacgaaaaaccicctcctgggcttatcaaggaaeatoVaac 

aLX-agatgtaccaggcaatgaacacgccagggaamctggcicatgcaccaactaaa^t!izaani;uat<!ccaci^ 

tcaaaguatgcagtgnggcgngcaaacgctatggtcaccgaacgggigacaaaaaa^teccctncmica 

gtiagagcagncagagtggcacaigaagatcccatgiatgacaicatacgagacaataaacLjacatLiaaaaooaVaiaai:^'^ 

gcagnaaaacagnaciggagganctacctcagatgaagataggagcagciccagnccictgaaggtaaaiaLia^acaca^ 

aaagaagaagaaagaiiaagcataagaaaaggaagaaagaaaagaaaaagaauaaaa 

aacggaagcacaaatcaccaagTcaaatgagggTtciaacica2aLn2aC.A.AGGATGTGACTTGTTC-\-\CAT 

TCTCTTCTC AAAC ACTGACC.AAGG AACAGAGG.A.AGATGCAGTC AG AG A A^AGC A 

GC AGGATAG AGACGCCG AGAGAGGAGTATATGTGGGTCAC AGC AGTGAGCTC 

CCACCCGCCTTGCAGTGAAGATGTGACCCCAGGAGAGGGAGTGTCTCCTTCCA 

GGTGCTAGCTCTGGACAGCAGCTGATTTTAGGCAGG.A.A.AGTTTCTTCATCGTTG 

T CCTCCC TGCTGGTC ACATGAGTTVACGATTCCTTTGAAGTGTCTCCC AC AGGG 

TGGC AGGACTGGGAGAATCTCTGAGGCGTGTCTTCCAGGCCCTCCC AC AGCTT 

GTGCCCTCCACAGTGTGGACTCAGGTCCCATAGACATCAGGCTGGAGTCTTCTC 
TGTTGT 

SEQ ID NO:2 

MSSRPGREDVGAAGARRPREPPEOELQRRREQKRRRHDAQQLQQLKHLESFYEKP 
PPGLIKEDETKPEDCIPDVPGNEHAREFLAHAPTKGL\V7vlPLGKEVKVVIQCWRCKR 
VGHRTGDKLECPFFIKGNQKLEQFRV.AHEDPMYDIIRDNICRHEKDVRJOOLKOLLE 
DSTSDEDRSSSSSSEGKEKHKKK^KKEKHKKRKKEKJC^^ 

SEQ ID NO:3 

GCGGCCGCGTGACCGCCGGAGC AGGAGCTGC AGCGACGTCGGG AGC AGAAG 

CGCGGCGACACGACCGCAGAGCTGCAGCAGCTCAAGCACCTGGAGTCCTTTl 

ACGAAAAACCTCCTCCTGGGCTTATCAAGGAAGATGAGACTAAGCCAGAAGA 

TTGCATACCAGATGTACCAGGCAATGAACACGCCAGGGAATTTCTGGCTCAT 

GCACCAACTAAAGGACTTTGGATGCCACTGGGGAAAGAAGTCAAAGTTATGC 

AGTGTTGGCGTTGCAAACgCTATGGTCACCGAACGGGTGACAAAGAATGCCC 

TTTCTTTATCAAAGGCAACC,A.AAAGTTAGAGC AGTTCAGAGTGGC ACATGA A 

GATCCCATGTATGACATCATACGAGACAATAAACGACATGAAAAGGACGTAA 

GGATACAGCAGTT.\.\AACAGTTACTGGAGGATTCTACCTCAGATGAAGATAG 

GAGCATCCAGTCCCTGAGTAGGTCCCGTCGTGCCGAAC 



SEQ ID iNO:4 

PPEOELQRRREQKRGDTTAELQQLKHLESFYEKPPPGLIKEDETKPEDCIPDVPGN 
EHAREFLAHAPTKGLWMPLGKEVKVMOCWRCKRYGHRTGDKECPFFIKGNQK 
LEQFRVAHEDPMYDIIRDNKRHEKDVRIQQLKOLLEDSTSDEDRSIOSLSRSRR.AE 
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Box I Observations where certain ctajms were found unsearchable (Continuation of item 1 of first sheet) 



'^is tnternaiirrai Sear:r ^epon -as net ceen es;aDii3nea ;n ^eso^c: ot ceaam ciaims uncJer Amc:e i :"2\a) 'cr tre tcilOAing reasons- 
X " la rs Nos 

::ecai-se tnev reia[r to sjbiect rnarrer nci r^Quirea 'o sear.-neaov ir:s Author^/ nareiv 

Remark: Although claimfs) 21, 22 

is(are) directed to a metnod of treatment of the human/am ma] 
body, the search has been carriea out and based on the alleged 
effects of the compound/composT ti on , 

2 _X_ C airns NOS 23 . 24 

□ecajse they --elate to parrs ot the international Aooncation tnai do notccmptv with the Drescnbea requirements :o sjcn 
jn ejaent that rc -^eaninqtui Internationa) Searcri can oe carnea Out, speciticaiiv 



5ee FURTHER INFORMATION sheet PCT/ISA/2I0 



3 Claims Nos.. 

Decause they are aeoenoent ciairrs ana are rot drattea n accordance with tne secora and thira sentences ot Pule 6 4(a) 



Sox II Observations where unity of invention is lacking {Continuation of item 2 ot first sheet) 



This nre'-r'a:;ona; Searcninq Ajtnorrn/ fcura multiple if^venttors tn tnis mterr^attonai applicat:or as follows 



As aii reouirea aiaitiona! searcn tees were fmeiv pato ov tne aoDHcar: ;hts I'^ternationa; Search Pecon covers ail 
•iearcnaoie ciaims 



J ail searcnao e c;aims couid be searched *:rout eftorr |Ls;l^lng an additional tee :his Authort^v a;d not mvite cavmeni 
;f arv additional 'ee 



As oniv some of rne reouired additional search tees A-ere nrreiy patd bv tne acpiicant I'-is internanonat Search Report 
covers oPty those ciaims for .vmch fees were caid. specifjcaiiv ciaims \g<; 



No required additional search fees were -irreiv naio bv tne aoplicar^; C ■''iSMq<.e(-'iv 'ms rternationai Searcn Pepon is 
restricted to tne invention drst meniioned ir -he claims: it is cove'-eo bv "i r^*; '-■■s 
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lort'nuati an of Box 1 . 2 
: : a^ms No5 . . 23 . 24 



:)aicl cUims relate to antagonists or agonists without giving a true 
tec.imcal characterization o-' the claimed matter. In conseauence. the 
scooe of saia claims 1s ambiguous and. moreover, their 5 uc ject-matte^ - 
/ague ana not sufficiently cnsclosed. 

^he applicant's attention is drawn to the fact that claims relating to 
inventions in respect cf which no international search report has been 
estaolisnea need not te the suoject of an international preliminary 
examination (Rule oo.U'e) PCT}. The applicant is advised that the EPO 
col icy wnen acting as an International Preliminary Examining Authority i 
normally not to carry out a preliminary examination on matter which has 
not Deen searched, 'his is tne case irrespective of whether or not the 
claims are amended following receipt of the search report or durlna any 
chapter II procedure. 



